Search CORE

69 research outputs found

Identifying interacting genetic variations by fish-swarm logic regression

Author: Cao Zhi
Wang Jiayin
Yan Chunxia
Yang Aiyuan
Zhang Xuanping
Zhao Zhongmeng
Zhu Feng
Publication venue: Digital Commons@Becker
Publication date: 01/01/2013
Field of study

Understanding associations between genotypes and complex traits is a fundamental problem in human genetics. A major open problem in mapping phenotypes is that of identifying a set of interacting genetic variants, which might contribute to complex traits. Logic regression (LR) is a powerful multivariant association tool. Several LR-based approaches have been successfully applied to different datasets. However, these approaches are not adequate with regard to accuracy and efficiency. In this paper, we propose a new LR-based approach, called fish-swarm logic regression (FSLR), which improves the logic regression process by incorporating swarm optimization. In our approach, a school of fish agents are conducted in parallel. Each fish agent holds a regression model, while the school searches for better models through various preset behaviors. A swarm algorithm improves the accuracy and the efficiency by speeding up the convergence and preventing it from dropping into local optimums. We apply our approach on a real screening dataset and a series of simulation scenarios. Compared to three existing LR-based approaches, our approach outperforms them by having lower type I and type II error rates, being able to identify more preset causal sites, and performing at faster speeds

Directory of Open Access Journals

Digital Commons@Becker

PubMed Central

The Missed Patient With Diabetes: How access to health care affects the detection of diabetes

Author: Beckles Gloria L.
Cheng Yiling J.
Geiss Linda S.
Gregg Edward W.
Kahn Henry S.
Zhang Xuanping
Publication venue: American Diabetes Association
Publication date: 01/01/2008
Field of study

OBJECTIVE—This study examined the association between access to health care and three classifications of diabetes status: diagnosed, undiagnosed, and no diabetes

Crossref

PubMed Central

Spiral - Imperial College Digital Repository

Query-dominant User Interest Network for Large-Scale Search Ranking

Author: Guo Tong
He junlin
Hou Jingyou
Ke Bingqing
Li Xuanping
Liang Xiao
Wenwu
Yang Haitao
Yu Enyun
Yuan Yong
Zhang Chao
Zhang Shunyu
Publication venue
Publication date: 10/10/2023
Field of study

Historical behaviors have shown great effect and potential in various prediction tasks, including recommendation and information retrieval. The overall historical behaviors are various but noisy while search behaviors are always sparse. Most existing approaches in personalized search ranking adopt the sparse search behaviors to learn representation with bottleneck, which do not sufficiently exploit the crucial long-term interest. In fact, there is no doubt that user long-term interest is various but noisy for instant search, and how to exploit it well still remains an open problem. To tackle this problem, in this work, we propose a novel model named Query-dominant user Interest Network (QIN), including two cascade units to filter the raw user behaviors and reweigh the behavior subsequences. Specifically, we propose a relevance search unit (RSU), which aims to search a subsequence relevant to the query first and then search the sub-subsequences relevant to the target item. These items are then fed into an attention unit called Fused Attention Unit (FAU). It should be able to calculate attention scores from the ID field and attribute field separately, and then adaptively fuse the item embedding and content embedding based on the user engagement of past period. Extensive experiments and ablation studies on real-world datasets demonstrate the superiority of our model over state-of-the-art methods. The QIN now has been successfully deployed on Kuaishou search, an online video search platform, and obtained 7.6% improvement on CTR.Comment: 10 page

arXiv.org e-Print Archive

Cancer LncRNA Census reveals evidence for deep functional conservation of long noncoding RNAs in tumorigenesis.

Author: Abascal Federico
Amin Samirkumar B.
Bader Gary D.
Barenboim Jonathan
Beroukhim Rameen
Bertl Johanna
Boroevich Keith A.
Brunak Soren
Campbell Peter J.
Carlevaro-Fita Joana
Carlevaro-Fita Joana
Chakravarty Dimple
Chan Calvin Wing Yiu
Chen Ken
Choi Jung Kyoon
Deu-Pons Jordi
Dhingra Priyanka
Diamanti Klev
Feuerbach Lars
Feuerbach Lars
Fink J. Lynn
Fonseca Nuno A.
Frigola Joan
Gambacorti-Passerini Carlo
Garsed Dale W.
Gerstein Mark
Getz Gad
Gonzalez-Perez Abel
Guo Qianyun
Gut Ivo G.
Haan David
Hamilton Mark P.
Haradhvala Nicholas J.
Harmanci Arif O.
Helmy Mohamed
Herrmann Carl
Hess Julian M.
Hobolth Asger
Hodzic Ermin
Hong Chen
Hong Chen
Hornshoj Henrik
Isaev Keren
Izarzugaza Jose M. G.
Johnson Rory
Johnson Todd A.
Juul Malene
Juul Randi Istrup
Kahles Andre
Kahraman Abdullah
Kellis Manolis
Khurana Ekta
Kim Jaegil
Kim Jong K.
Kim Youngwook
Komorowski Jan
Korbel Jan O.
Kumar Sushant
Lanzos Andres
Lanzos Andres
Larsson Erik
Lawrence Michael S.
Lee Donghoon
Lehmann Kjong-Van
Li Shantao
Li Xiaotong
Lin Ziao
Liu Eric Minwei
Lochovsky Lucas
Lou Shaoke
Madsen Tobias
Marchal Kathleen
Martincorena Inigo
Martinez-Fundichely Alexander
Maruvka Yosef E.
Mas-Ponte David
McGillivray Patrick D.
Meyerson William
Muinos Ferran
Mularoni Loris
Nakagawa Hidewaki
Nielsen Morten Muhlig
Paczkowska Marta
Park Keunchil
Park Kiejung
Pedersen Jakob Skou
Pedersen Jakob Skou
Pich Oriol
Pons Tirso
Pulido-Tamayo Sergio
Raphael Benjamin J.
Reimand Juri
Reyes-Salazar Iker
Reyna Matthew A.
Rheinbay Esther
Rubin Mark A.
Rubio-Perez Carlota
Sabarinathan Radhakrishnan
Sahinalp S. Cenk
Saksena Gordon
Salichos Leonidas
Sander Chris
Schumacher Steven E.
Shackleton Mark
Shapira Ofer
Shen Ciyue
Shrestha Raunak
Shuai Shimin
Sidiropoulos Nikos
Sieverling Lina
Sinnott-Armstrong Nasa
Stein Lincoln D.
Stuart Joshua M.
Tamborero David
Tiao Grace
Tsunoda Tatsuhiko
Umer Husen M.
Uuskula-Reimand Liis
Valencia Alfonso
Vazquez Miguel
Verbeke Lieven P. C.
von Mering Christian
Wadelius Claes
Wadi Lina
Wang Jiayin
Warrell Jonathan
Waszak Sebastian M.
Weischenfeldt Joachim
Wheeler David A.
Wu Guanming
Yu Jun
Zhang Jing
Zhang Xuanping
Zhang Yan
Zhao Zhongming
Zou Lihua
Publication venue: Commun Biol
Publication date: 01/01/2020
Field of study

Long non-coding RNAs (lncRNAs) are a growing focus of cancer genomics studies, creating the need for a resource of lncRNAs with validated cancer roles. Furthermore, it remains debated whether mutated lncRNAs can drive tumorigenesis, and whether such functions could be conserved during evolution. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, we introduce the Cancer LncRNA Census (CLC), a compilation of 122 GENCODE lncRNAs with causal roles in cancer phenotypes. In contrast to existing databases, CLC requires strong functional or genetic evidence. CLC genes are enriched amongst driver genes predicted from somatic mutations, and display characteristic genomic features. Strikingly, CLC genes are enriched for driver mutations from unbiased, genome-wide transposon-mutagenesis screens in mice. We identified 10 tumour-causing mutations in orthologues of 8 lncRNAs, including LINC-PINT and NEAT1, but not MALAT1. Thus CLC represents a dataset of high-confidence cancer lncRNAs. Mutagenesis maps are a novel means for identifying deeply-conserved roles of lncRNAs in tumorigenesis

Repository for Publications and Research Data

DSpace@MIT

Lund University Publications

Publikationer från Uppsala Universitet

Ghent University Academic Bibliography

Digitala Vetenskapliga Arkivet - Academic Archive On-line

UPF Digital Repository

Apollo (Cambridge)

Bern Open Repository and Information System (BORIS)

Analyses of non-coding somatic drivers in 2,658 cancer whole genomes.

Author: Abascal Federico
Akdemir Kadir C.
Alvarez Eva G.
Amin Samirkumar B.
Bader Gary D.
Baez-Ortega Adrian
Bandopadhayay Pratiti
Barenboim Jonathan
Beroukhim Rameen
Bertl Johanna
Boroevich Keith A.
Boutros Paul C.
Bowtell David D. L.
Brors Benedikt
Brunak Soren
Burns Kathleen H.
Busanovich John
Campbell Peter J.
Carlevaro-Fita Joana
Chakravarty Dimple
Chan Calvin Wing Yiu
Chan Kin
Chen Ken
Choi Jung Kyoon
CortesCiriano Isidro
Craft David
Deu-Pons Jordi
Dhingra Priyanka
Diamanti Klev
Dueso-Barroso Ana
Dunford Andrew J.
Edwards Paul A.
Estivill Xavier
Etemadmoghadam Dariush
Feuerbach Lars
Fink J. Lynn
Fonseca Nuno A.
Frenkel-Morgenstern Milana
Frigola Joan
Gambacorti-Passerini Carlo
Garsed Dale W.
Gerstein Mark
Getz Gad
Gonzalez-Perez Abel
Gordenin Dmitry A.
Guo Qianyun
Gut Ivo G.
Haan David
Haber James E.
Hamilton Mark P.
Haradhvala Nicholas J.
Harmanci Arif O.
Helmy Mohamed
Herrmann Carl
Hess Julian M.
Hobolth Asger
Hodzic Ermin
Hong Chen
Hornshoj Henrik
Hutter Barbara
Imielinski Marcin
Isaev Keren
Izarzugaza Jose M. G.
Johnson Rory
Johnson Todd A.
Jones David T. W.
Ju Young Seok
Juul Malene
Juul Randi Istrup
Kahles Andre
Kahraman Abdullah
Kazanov Marat D.
Kellis Manolis
Khurana Ekta
Kim Jaegil
Kim Jong K.
Kim Youngwook
Klimczak Leszek J.
Koh Youngil
Komorowski Jan
Korbel Jan O.
Kumar Kiran
Kumar Sushant
Lanzos Andres
Larsson Erik
Lawrence Michael S.
Lee Donghoon
Lee Eunjung Alice
Lee Jake June-Koo
Lehmann Kjong-Van
Li Shantao
Li Xiaotong
Li Yilong
Lin Ziao
Liu Eric Minwei
Lochovsky Lucas
Lopez-Bigas Nuria
Lou Shaoke
Lynch Andy G.
Macintyre Geoff
Madsen Tobias
Marchal Kathleen
Markowetz Florian
Martincorena Inigo
Martinez-Fundichely Alexander
Maruvka Yosef E.
McGillivray Patrick D.
Meyerson Matthew
Meyerson William
Miyano Satoru
Muinos Ferran
Mularoni Loris
Nakagawa Hidewaki
Navarro Fabio C. P.
Nielsen Morten Muhlig
Ossowski Stephan
Paczkowska Marta
Park Keunchil
Park Kiejung
Park Peter J.
Pearson John, V
Pedersen Jakob Skou
Pich Oriol
Pons Tirso
Puiggros Montserrat
Pulido-Tamayo Sergio
Raphael Benjamin J.
Reimand Juri
Reyes-Salazar Iker
Reyna Matthew A.
Rheinbay Esther
Rippe Karsten
Roberts Nicola D.
Roberts Steven A.
RodriguezMartin Bernardo
Rubin Mark A.
Rubio-Perez Carlota
Sabarinathan Radhakrishnan
Sahinalp S. Cenk
Saksena Gordon
Salichos Leonidas
Sander Chris
Schumacher Steven E.
Scully Ralph
Shackleton Mark
Shapira Ofer
Shen Ciyue
Shrestha Raunak
Shuai Shimin
Sidiropoulos Nikos
Sieverling Lina
Sinnott-Armstrong Nasa
Stein Lincoln D.
Stewart Chip
Stuart Joshua M.
Tamborero David
Tiao Grace
Torrents David
Tsunoda Tatsuhiko
Tubio Jose M. C.
Umer Husen Muhammad
Uuskula-Reimand Liis
Valencia Alfonso
Vazquez Miguel
Verbeke Lieven P. C.
Villasante Izar
von Mering Christian
Waddell Nicola
Wadelius Claes
Wadi Lina
Wala Jeremiah A.
Wang Jiayin
Warrell Jonathan
Waszak Sebastian M.
Weischenfeldt Joachim
Wheeler David A.
Wu Guanming
Yang Lixing
Yao Xiaotong
Yoon Sung-Soo
Yu Jun
Zamora Jorge
Zhang Cheng-Zhong
Zhang Jing
Zhang Xuanping
Zhang Yan
Zhao Zhongming
Zou Lihua
Publication venue: Nature
Publication date: 01/01/2020
Field of study

The discovery of drivers of cancer has traditionally focused on protein-coding genes1-4. Here we present analyses of driver point mutations and structural variants in non-coding regions across 2,658 genomes from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium5 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). For point mutations, we developed a statistically rigorous strategy for combining significance levels from multiple methods of driver discovery that overcomes the limitations of individual methods. For structural variants, we present two methods of driver discovery, and identify regions that are significantly affected by recurrent breakpoints and recurrent somatic juxtapositions. Our analyses confirm previously reported drivers6,7, raise doubts about others and identify novel candidates, including point mutations in the 5' region of TP53, in the 3' untranslated regions of NFKBIZ and TOB1, focal deletions in BRD4 and rearrangements in the loci of AKR1C genes. We show that although point mutations and structural variants that drive cancer are less frequent in non-coding genes and regulatory sequences than in protein-coding genes, additional examples of these drivers will be found as more cancer genomes become available

Publikationsserver der Universität Tübingen

Digitala Vetenskapliga Arkivet - Academic Archive On-line

UPF Digital Repository

Repository for Publications and Research Data

DSpace@MIT

Lund University Publications

Ghent University Academic Bibliography

Publikationer från Uppsala Universitet

UCL Discovery

Copenhagen University Research Information System

eScholarship - University of California

Apollo (Cambridge)

Bern Open Repository and Information System (BORIS)

University of St. Andrews - Pure

St Andrews Research Repository

Effectiveness of interventions to promote screening for diabetic retinopathy.

Author: Zhang Xuanping,
Publication venue
Publication date: 22/09/2017
Field of study

Ezid

Auditory Cryptography Security Algorithm With Audio Shelters

Author: Li Huan
Qin Zheng
Wang Xu
Zhang Xuanping
Publication venue: Published by Elsevier Ltd.
Publication date: 31/12/2011
Field of study

AbstractIn this paper, auditory cryptography security algorithm with audio shelters is proposed. The meaningful audio watermarking is pretreated to high-fidelity binary audio, and the binary audio is encrypted to n cryptographic audios by (k, n) threshold scheme. Less than k of the cryptographic audios give no information, only synchronized playing k or more than k of the audios the original can be heard directly. The n cryptographic audios are embedded in the corresponding n shelter audios which are pretreated by high-dimensional matrix transformation. Experiments show that the proposed algorithm has strong practicability, high security and robustness in enduring common attacks

Elsevier - Publisher Connector

時変システムのオンライン同定のための適応ＰＳＯ

Author: BLACKWELL T
BLACKWELL T M
ISHIGAME ATSUSHI
ZHANG XUANPING
Publication venue: 'Institute of Electrical Engineers of Japan (IEE Japan)'
Publication date: 01/01/2011
Field of study

Crossref

An Efficient Algorithm for Sensitively Detecting Circular RNA from RNA-seq Data

Author: Jiayin Wang
Xuanping Zhang
Yidan Wang
Zhongmeng Zhao
Publication venue: 'MDPI AG'
Publication date: 01/09/2018
Field of study

Circular RNA (circRNA) is an important member of non-coding RNA family. Numerous computational methods for detecting circRNAs from RNA-seq data have been developed in the past few years, but there are dramatic differences among the algorithms regarding the balancing of the sensitivity and precision of the detection and filtering strategies. To further improve the sensitivity, while maintaining an acceptable precision of circRNA detection, a novel and efficient de novo detection algorithm, CIRCPlus, is proposed in this paper. CIRCPlus accurately locates circRNA candidates by identifying a set of back-spliced junction reads by comparing the local similar sequence of each pair of spanning junction reads. This strategy, thus, utilizes the important information provided by unbalanced spanning reads, which facilitates the detection especially when the expression levels of circRNA are unapparent. The performance of CIRCPlus was tested and compared to the existing de novo methods on the real datasets as well as a series of simulation datasets with different configurations. The experiment results demonstrated that the sensitivities of CIRCPlus were able to reach 90% in common simulation settings, while CIRCPlus held balanced sensitivity and reliability on the real datasets according to an objective assessment criteria based on RNase R-treated samples. The software tool is available for academic uses only

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

DelInsCaller: An Efficient Algorithm for Identifying Delins and Estimating Haplotypes from Long Reads with High Level of Sequencing Errors

Author: Geng Qiang
Jiayin Wang
Shenjie Wang
Xuanping Zhang
Publication venue: 'MDPI AG'
Publication date: 01/12/2022
Field of study

Delins, as known as complex indel, is a combined genomic structural variation formed by deleting and inserting DNA fragments at a common genomic location. Recent studies emphasized the importance of delins in cancer diagnosis and treatment. Although the long reads from PacBio CLR sequencing significantly facilitate delins calling, the existing approaches still encounter computational challenges from the high level of sequencing errors, and often introduce errors in genotyping and phasing delins. In this paper, we propose an efficient algorithmic pipeline, named delInsCaller, to identify delins on haplotype resolution from the PacBio CLR sequencing data. delInsCaller design a fault-tolerant method by calculating a variation density score, which helps to locate the candidate mutational regions under a high-level of sequencing errors. It adopts a base association-based contig splicing method, which facilitates contig splicing in the presence of false-positive interference. We conducted a series of experiments on simulated datasets, and the results showed that delInsCaller outperformed several state-of-the-art approaches, e.g., SVseq3, across a wide range of parameter settings, such as read depth, sequencing error rates, etc. delInsCaller often obtained higher f-measures than other approaches; specifically, it was able to maintain advantages at ~15% sequencing errors. delInsCaller was able to significantly improve the N50 values with almost no loss of haplotype accuracy compared with the existing approach as well

Directory of Open Access Journals